AITopics

2605.11841

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.89)

Husar, Kateryna, Volfovsky, Alexander

DARTS: Targeting Prognostic Covariates in Budget-Constrained Sequential Experiments

arXiv.org Machine LearningMay-8-2026

Randomized controlled trials typically assume that prognostic covariates are known and available at no cost. In practice, obtaining high-dimensional pretreatment data is costly, forcing a trade-off between covariate-adaptive precision and a measurement budget. We introduce Dynamic Adaptive Rerandomization via Thompson Sampling (DARTS), which treats covariate acquisition as a sequential optimization problem embedded within a design-based causal inference task. A budgeted combinatorial Thompson sampler learns which covariates are most prognostic across successive batches; selected covariates then drive rerandomization and regression adjustment to reduce batch-level average treatment effect variance. Our primary theoretical contribution is a decoupling result: adaptive covariate selection based on past batches preserves batch-level randomization validity, and the cumulative inverse-variance weighted estimator achieves at least nominal asymptotic coverage. We further derive a Bayes risk bound for the acquisition layer that matches the minimax lower bound up to logarithmic factors. Empirically, DARTS systematically concentrates the budget on informative features, significantly closing the efficiency gap to oracle designs while maintaining strict inferential validity.

artificial intelligence, covariate, machine learning, (18 more...)

2605.06608

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Hundrieser, Shayan, Kong, Insung, Schmidt-Hieber, Johannes

Hyper Input Convex Neural Networks for Shape Constrained Learning and Optimal Transport

arXiv.org Machine LearningApr-30-2026

We introduce Hyper Input Convex Neural Networks (HyCNNs), a novel neural network architecture designed for learning convex functions. HyCNNs combine the principles of Maxout networks with input convex neural networks (ICNNs) to create a neural network that is always convex in the input, theoretically capable of leveraging depth, and performs reliable when trained at scale compared to ICNNs. Concretely, we prove that HyCNNs require exponentially fewer parameters than ICNNs to approximate quadratic functions up to a given precision. Throughout a series of synthetic experiments, we demonstrate that HyCNNs outperform existing ICNNs and MLPs in terms of predictive performance for convex regression and interpolation tasks. We further apply HyCNNs to learn high-dimensional optimal transport maps for synthetic examples and for single-cell RNA sequencing data, where they oftentimes outperform ICNN-based neural optimal transport methods and other baselines across a wide range of settings.

artificial intelligence, hycnn, machine learning, (19 more...)

2604.26942

Country: North America > United States (0.45)

Genre: Research Report (0.81)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Neural Information Processing SystemsApr-26-2026, 21:21:11 GMT

Supplementary Material for HUMUS Net Hybrid Unrolled Multi Scale Network Architecture for Accelerated Net baseline details

Our default model has 3RSTB-D downsampling blocks, 2RSTB-B bottleneck blocks and 3RSTB-U upsampling blocks with 3 6 12 attention heads in the D/U blocks and 24 attention heads in the bottleneck block. For Swin Transformers layers, the window size is 8 for all methods and MLP ratio (hidden_dim/input_dim) of 2 is used. Each RSTB block consists of 2 STLs with embedding dimension of 66. For HUMUS-Net-L, we increase the embedding dimension to 96. We use 8cascades of unrolling with a U-Net as sensitivity map estimator (same as in E2E-VarNet) with 16channels.

artificial intelligence, machine learning, natural language, (17 more...)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.70)

Neural Information Processing SystemsApr-24-2026, 17:09:42 GMT

Results

In this section we prove the theoretical results around the dual curriculum game and use these results to show approximation bounds for our methods, given that they have reached a Nash equilibrium (NE). The first theorem is the main result that allows us to analyze dual curriculum games. The high-level result says that the NE of a dual curriculum game are approximate NE of the base game from the perspective of any of the individual players, or from the perspective of the joint strategy. Let Bbe the maximum difference between U1t and U2t, and let (π,θ1,θ2) be a NE for G. Then (π,pθ1 + (1 p)θ2) is an approximate NE for the base game with either teacher or for a teacher optimizing their joint objective. More precisely, it is a 2Bp(1 p)-approximate NE when Ut = pU1t + (1 p)U2t, a 2B(1 p)-approximate NE when Ut = U1t, and a 2Bp-approximate NE when Ut = U2t. At a high level, this is true because, for low values of p, the best-response strategies for the individual players can be thought of as approximate-best response strategies for the joint-player, and vis-versa. Since the Nash Equilibrium consists of each of the players playing their own best response, they must be playing an approximate best response for the joint-player. We provide a formal proof below: Proof. Let B be the maximum difference between U1t and U2t, and let (π,θ1,θ2) be a Nash Equilibrium for G. Then consider pθ1 + (1 p)θ2 as a strategy in the base game for the joint player pU1t + (1 p)U2t.

agent, artificial intelligence, machine learning, (19 more...)

Country:

Europe (1.00)
Asia > China (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports > Motorsports > Formula One (1.00)
Leisure & Entertainment > Games (0.74)

Technology:

Information Technology > Game Theory (0.90)
Information Technology > Artificial Intelligence > Machine Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Neural Information Processing SystemsApr-24-2026, 12:10:50 GMT

0678c572b0d5597d2d4a6b5bd135754c-Supplemental.pdf

artificial intelligence, machine learning, optimization problem, (19 more...)

Country: North America > United States (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Neural Information Processing SystemsApr-24-2026, 09:48:55 GMT

00b76fddeaaa7d8c2c43d504b2babd8a-Supplemental.pdf

artificial intelligence, bast, machine learning, (18 more...)

Country: North America > United States > Texas (0.16)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Neural Information Processing SystemsApr-24-2026, 08:15:33 GMT

05b63fa06784b71aab3939004e0f0a0d-Supplemental-Conference.pdf

artificial intelligence, domain randomization, machine learning, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Machine LearningApr-9-2026

CRPS-Optimal Binning for Univariate Conformal Regression

Toccaceli, Paolo

We propose a method for non-parametric conditional distribution estimation based on partitioning covariate-sorted observations into contiguous bins and using the within-bin empirical CDF as the predictive distribution. Bin boundaries are chosen to minimise the total leave-one-out Continuous Ranked Probability Score (LOO-CRPS), which admits a closed-form cost function with $O(n^2 \log n)$ precomputation and $O(n^2)$ storage; the globally optimal $K$-partition is recovered by a dynamic programme in $O(n^2 K)$ time. Minimisation of within-sample LOO-CRPS turns out to be inappropriate for selecting $K$ as it results in in-sample optimism. We instead select $K$ by $K$-fold cross-validation of test CRPS, which yields a U-shaped criterion with a well-defined minimum. Having selected $K^*$ and fitted the full-data partition, we form two complementary predictive objects: the Venn prediction band and a conformal prediction set based on CRPS as the nonconformity score, which carries a finite-sample marginal coverage guarantee at any prescribed level $\varepsilon$. The conformal prediction is transductive and data-efficient, as all observations are used for both partitioning and p-value calculation, with no need to reserve a hold-out set. On real benchmarks against split-conformal competitors (Gaussian split conformal, CQR, CQR-QRF, and conformalized isotonic distributional regression), the method produces substantially narrower prediction intervals while maintaining near-nominal coverage.

artificial intelligence, machine learning, prediction, (19 more...)